NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Performance measurements of the electromagnetic calorimeter and readout electronics system for the DarkQuest experiment

https://doi.org/10.1016/j.nima.2025.170792

Apyan, Aram; Cosby, Christopher; Feng, Yongbin; Gelgen, Alp; Gori, Stefania; Harris, Philip; Liu, Xinlong; Liu, Mia; Maksimovic, Petar; Mantilla-Suarez, Cristina; et al (November 2025, Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment)

Free, publicly-accessible full text available November 1, 2026
SuperSONIC: Cloud-Native Infrastructure for ML Inferencing

https://doi.org/10.1145/3708035.3736049

Kondratyev, Dmitry; Riedel, Benedikt; Chou, Yuan-Tang; Cochran-Branson, Miles; Paladino, Noah; Schultz, David; Liu, Mia; Duarte, Javier; Harris, Philip; Hsu, Shih-Chieh (July 2025, ACM)

The increasing computational demand from growing data rates and complex machine learning (ML) algorithms in large-scale scientific experiments has driven the adoption of the Services for Optimized Network Inference on Coprocessors (SONIC) approach. SONIC accelerates ML inference by offloading it to local or remote coprocessors to optimize resource utilization. Leveraging its portability to different types of coprocessors, SONIC enhances data processing and model deployment efficiency for cutting-edge research in high energy physics (HEP) and multi-messenger astrophysics (MMA). We developed the SuperSONIC project, a scalable server infrastructure for SONIC, enabling the deployment of computationally intensive tasks to Kubernetes clusters equipped with graphics processing units (GPUs). Using NVIDIA Triton Inference Server, SuperSONIC decouples client workflows from server infrastructure, standardizing communication, optimizing throughput, load balancing, and monitoring. SuperSONIC has been successfully deployed for the CMS and ATLAS experiments at the CERN Large Hadron Collider (LHC), the IceCube Neutrino Observatory (IceCube), and the Laser Interferometer Gravitational-Wave Observatory (LIGO) and tested on Kubernetes clusters at Purdue University, the National Research Platform (NRP), and the University of Chicago. SuperSONIC addresses the challenges of the Cloud-native era by providing a reusable, configurable framework that enhances the efficiency of accelerator-based inference deployment across diverse scientific domains and industries.
more » « less
Free, publicly-accessible full text available July 18, 2026
Interpretable Geometric Deep Learning via Learnable Randomness Injection

Miao, Siqi; Luo, Yunan; Liu, Mia; Li, Pan (February 2023, ICLR/OpenReview.net)

Full Text Available
Interpretable and Generalizable Graph Learning via Stochastic Attention Mechanism

Miao, Siqi; Liu, Mia; Li, Pan (July 2022, Proceedings of Machine Learning Research)
Chaudhuri, Kamalika and (Ed.)
Interpretable graph learning is in need as many scientific applications depend on learning models to collect insights from graph-structured data. Previous works mostly focused on using post-hoc approaches to interpret pre-trained models (graph neural networks in particular). They argue against inherently interpretable models because the good interpretability of these models is often at the cost of their prediction accuracy. However, those post-hoc methods often fail to provide stable interpretation and may extract features that are spuriously correlated with the task. In this work, we address these issues by proposing Graph Stochastic Attention (GSAT). Derived from the information bottleneck principle, GSAT injects stochasticity to the attention weights to block the information from task-irrelevant graph components while learning stochasticity-reduced attention to select task-relevant subgraphs for interpretation. The selected subgraphs provably do not contain patterns that are spuriously correlated with the task under some assumptions. Extensive experiments on eight datasets show that GSAT outperforms the state-of-the-art methods by up to 20% in interpretation AUC and 5% in prediction accuracy. Our code is available at https://github.com/Graph-COM/GSAT. https://arxiv.org/abs/2201.12987 https://proceedings.mlr.press/v162/miao22a.html
more » « less
Full Text Available
Interpretable and Generalizable Graph Learning via Stochastic Attention Mechanism

Miao, Siqi; Liu, Mia; Li, Pan (June 2022, Interpretable and Generalizable Graph Learning via Stochastic Attention Mechanism)

Full Text Available
GPU coprocessors as a service for deep learning inference in high energy physics

https://doi.org/10.1088/2632-2153/abec21

Krupa, Jeffrey; Lin, Kelvin; Acosta Flechas, Maria; Dinsmore, Jack; Duarte, Javier; Harris, Philip; Hauck, Scott; Holzman, Burt; Hsu, Shih-Chieh; Klijnsma, Thomas; et al (April 2021, Machine Learning: Science and Technology)
null (Ed.)
Full Text Available
Fast convolutional neural networks on FPGAs with hls4ml

https://doi.org/10.1088/2632-2153/ac0ea1

Aarrestad, Thea; Loncar, Vladimir; Ghielmetti, Nicolò; Pierini, Maurizio; Summers, Sioni; Ngadiuba, Jennifer; Petersson, Christoffer; Linander, Hampus; Iiyama, Yutaro; Di Guglielmo, Giuseppe; et al (July 2021, Machine Learning: Science and Technology)
null (Ed.)
Full Text Available
FPGAs-as-a-Service Toolkit (FaaST)

Rankin, Dylan; Krupa, Jeffrey; Harris, Philip; Flechas, Maria; Holzman, Burt; Klijnsma, Thomas; Pedro, Kevin; Tran, Nhan; Hauck, Scott; Hsu, Shih-Chieh; et al (October 2020, ArXivorg)
null (Ed.)
Computing needs for high energy physics are already intensive and are expected to increase drastically in the coming years. In this context, heterogeneous computing, specifically as-a-service computing, has the potential for significant gains over traditional computing models. Although previous studies and packages in the field of heterogeneous computing have focused on GPUs as accelerators, FPGAs are an extremely promising option as well. A series of workflows are developed to establish the performance capabilities of FPGAs as a service. Multiple different devices and a range of algorithms for use in high energy physics are studied. For a small, dense network, the throughput can be improved by an order of magnitude with respect to GPUs as a service. For large convolutional networks, the throughput is found to be comparable to GPUs as a service. This work represents the first open-source FPGAs-as-a-service toolkit.
more » « less
Full Text Available
FPGAs-as-a-Service Toolkit (FaaST)

https://doi.org/10.1109/H2RC51942.2020.00010

Rankin, Dylan; Krupa, Jeffrey; Harris, Philip; Flechas, Maria Acosta; Holzman, Burt; Klijnsma, Thomas; Pedro, Kevin; Tran, Nhan; Hauck, Scott; Hsu, Shih-Chieh; et al (November 2020, 2020 IEEE/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC))
null (Ed.)
Computing needs for high energy physics are already intensive and are expected to increase drastically in the coming years. In this context, heterogeneous computing, specifically as-a-service computing, has the potential for significant gains over traditional computing models. Although previous studies and packages in the field of heterogeneous computing have focused on GPUs as accelerators, FPGAs are an extremely promising option as well. A series of workflows are developed to establish the performance capabilities of FPGAs as a service. Multiple different devices and a range of algorithms for use in high energy physics are studied. For a small, dense network, the throughput can be improved by an order of magnitude with respect to GPUs as a service. For large convolutional networks, the throughput is found to be comparable to GPUs as a service. This work represents the first open-source FPGAs-as-a-service toolkit.
more » « less
Full Text Available
Accelerated Charged Particle Tracking with Graph Neural Networks on FPGAs

Heinz, Aneesh; Razavimaleki, Vasall; Duarte, Javier; DeZoort, Gage; Ojalvo, Isobel; Thais, Savannah; Atkinson, Markus; Neubauer, Mark; Gray, Lindsey; Jindariani, Sergo; et al (November 2020, ArXivorg)
null (Ed.)
We develop and study FPGA implementations of algorithms for charged particle tracking based on graph neural networks. The two complementary FPGA designs are based on OpenCL, a framework for writing programs that execute across heterogeneous platforms, and hls4ml, a high-level-synthesis-based compiler for neural network to firmware conversion. We evaluate and compare the resource usage, latency, and tracking performance of our implementations based on a benchmark dataset. We find a considerable speedup over CPU-based execution is possible, potentially enabling such algorithms to be used effectively in future computing workflows and the FPGA-based Level-1 trigger at the CERN Large Hadron Collider.
more » « less
Full Text Available

« Prev Next »

Search for: All records